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VIRTUAL MACHINE MEMORY MANAGEMENT 

FIELD OF INVENTION 

This invention relates to memory management in a 
multithreaded runtime environment. 

BACKGROUND OF INVENTION 

The Java programming language has its origins in a 
project undertaken by Sun Microsystems to develop a 
robust programming environment that would meet the 
technical challenges of the consumer device software 
environment. The original consumer device projects were 
eventually abandoned but the Java programming language 
found itself being used on the World Wide Web to enable 
cross platform operation of programs downloaded from the 
internet. It is simple to use having similar features to 
C++ such as the basic object orientated technology but 
without some of the more complex features. 

Typically, Java applications (source code) are 
compiled by the Javac compiler into Java byte code 
(intermediary code or pseudo object code) which can be 
loaded and executed by a Java Virtual Machine (JVM) (see 
Figure 1) . The JVM provides an instruction set, memory 
management and program loading capability that is 
independent of the hardware platform on which it has been 
implemented. The Java application source code is 
compiled into architecture independent byte code and the 
byte code is interpreted by a JVM on the target platform. 
Java is designed to be portable and follows some defined 
portability standards, which intend the source code to be 
"write once, run anywhere". The Java byte code may be 
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further compiled into machine code (object code) for the 
target platform at which point the architectural 
independent nature of Java is lost. 



it simulates a hardware machine that processes Java byte 
code. The byte code is interpreted and processed by a 
JVM such as an Windows JVM running on a Intel personal 
computer platform. The JVM includes components for 
loading class files, interpreting the byte code, garbage 
collecting redundant objects, and for managing multiple 
processing threads. The JVM may also include a 
Just-In-Time compiler to transform some or all the byte 
code into native machine code. 

Multithreading is a feature built into the Java 
language to allow users to improve interactive 
performance by allowing operations to be performed while 
continuing to process user actions. Multithreading is 
similar to multitasking, but whereas multitasking allows 
many applications to run on the same system in several 
processes, multithreading allows many routines (threads) 
in one application to potentially run in parallel within 
one process . 

Garbage collection is the term used for describing 
how program objects are automatically discarded by the 
system after they are no longer useful . 

For further information on garbage collection see 
Chapter 1 of 'Garbage Collection' by H Jones & R Lins, 
Wiley. Chapter 4 deals with Mark & Sweep techniques. 



Many current implementations of Java use the classic 



delivered in the baVe SUN JVM. References to the objects 



The JVM is a software computing machine, effectively 




method of garbage collection as 
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that are being processed at any instant by the system are 
stored in the registers, one or more thread stacks and 
some global variables. The totality of objects that are 
maybe needed by the system can be found by tracing 
through the objects directly referenced in the registers, 
stacks, and global variables and then tracing through 
these "root" objects for further references. The objects 
in use by a system thereby form a graph rooted at the 
root ob j ects and any extraneous ob j ects are not part of 
this graph. Once all the objects in the graph are found, 
the remaining objects in the heaps may be discarded 
(garbage collected) . 

The traditional mark and sweep garbage collection 
method is described below in terms of pseudo code with 
respect to a single heap: 

Stop all threads causing the active registers for 

each thread to be stored in its stack 

Trace all stacks for object references - the local 

roots 

Trace all global variables object references - the 
global roots 

Trace through root set for references until no new 
object references (the sum of the local and global 
roots is the root set) . 

Delete all objects in the single heap that are not 
referenced 

Note that in a particular Java virtual machine the term 
'global variables' may include variables in classes, the 
global JNI table, objects awaiting finalisation and 
strings which have been 'interned'. 
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There are problems with this technique in a 
multi-threaded and long running environment. The first 
problem is that in order to garbage collect all the 
threads must be stopped in order to work out what objects 
are unreachable (i.e., there are no pointers to them in 
the global or local variables and no pointers to them in 
other reachable objects) . Various authors have attempted 
to solve this problem. One approach is an on-the-fly 
collector, which does not stop all threads, however, it 
cannot compact the reachable objects leading to 
fragmentation. Another approach are the generational 
scavenging schemes, which is to reduce the size of the 
set of traced objects by concentrating effort on the most 
recently allocated objects; however, these schemes must 
stop all of the threads. In an ideal world we would like 
to achieve a collector which works independently on all 
threads and compacts the local heap of the threads to 
maximise the free space available. 

Another solution attempts to achieve this in a 
language (ML) .other than Java by taking advantage of 
immutable objects which can be placed in thread-local 
heaps. An immutable object is non-modifiable and when 
such an object become reachable globally a copy of the 
object can be made in the global heap. Clearly this 
technique is only applicable to languages defining 
immutable objects. 

Another approach moves an object into the global 
heap on first use. The difficulty here is that in order 
to move the object, references from elsewhere to it must 
be updated; in an environment where objects are 
referenced by handles this is made easier although there 
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are still cases' where objects cannot be moved. 
Unfortunately handles bring their own problems and the 
IBM ports of the JVM have removed handles to improve 
performance and remove the need to subdivide the heap 
into handles and object spaces. 

SUMMARY OF INVENTION 

According to one aspect of the invention there is 
provided a method of managing memory in a multi -threaded 
processing environment including respective local thread 
stacks and heaps and a global heap, said method 
comprising : 

creating an object in a thread heap; and monitoring 
whether the object is reachable from anywhere other than 
the thread stack or registers. 

Preferably the method further comprises : associating 
a local status with the obj ect ; and changing the status 
of the object to global under certain conditions. 

More preferably the method further comprises 
deleting from the thread heap one or more local objects 
when they are not reachable from a local root . 

Advantageously where reachability is determined by 
tracing from the local root. 

More advantageously the status of an object in the 
thread heap is changed to global if the object is 
assigned to a static variable or if the object is 
assigned to a field in a global object . 

Hence this proposal concentrates on the case of 
implementing a thread local heap which has not been done 
before. In essence the solution is to keep track of 
object references using write barriers placed in any 
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operation which assigns references eg putfield, putstatic 
and aastore. 

The putfield operation for example is a java 
bytecode which causes the value on the top of the stack 
to be placed in the object referenced below it on the 
stack. The object field to be updated is defined by the 
constant pool reference (2 bytes) which follows the 
bytecode. See p. 325 'Java Virtual Machine 
Specification', Lindholm & Yellin 1997. 

A flag is associated with each object in the heap. 
In practice the flag is easily provided by ensuring that 
all objects on the heap are allocated on an 8 byte 
boundary ensuring that the low three bits are 0. Two of 
these bits are currently used to denote pinned or free 
objects leaving a spare flag which we use to denote 
'global'. When an object is allocated it is generally 
placed in a piece of storage associated with the 
allocating thread; the thread local heap. The global 
flag is unset at this point. From now on the write 
barriers implement the following rules: 

1) if an object reference is assigned to a static 
(global) variable the object becomes global and all 
objects reachable from it also become global; and 

2) if a reference to a non-global object is 
assigned to another object which is global then the 
referenced object and all objects that are reachable from 
it also becomes global. 

Thus at any point the thread local heap will contain 
objects which have the global flag unset and some which 
have the flag set. Any objects with the flag unset can 
be garbage collected by the thread itself. The objects 
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marked global must at this stage be treated as reachable, 
pinned objects; they must not be moved or deleted. Any 
garbage collection policy can be adopted including 
compaction for the objects in the local heap, the only 
constraint is that we must of course compact into the 
potentially fragmented spaces left between the global 
objects. Note that the write barriers described above are 
extremely simple and can be implemented to have a very 
small effect on performance. 

At some point the global objects will be moved from 
the local heaps into the global heap and this must be 
done using the current collection approach of stopping 
all threads. The belief is of course that the frequency 
of such events can be dramatically reduced and the work 
to be done also reduced. The global objects need to be 
removed and put in their own heap, once that is done the 
individual threads can be left to continue any tidying 
they need to do on their own heaps. The great 
improvement is that the moving of global objects is the 
only part which must be done whilst all program threads 
are stopped. Tidying of the local heaps can be then done 
in parallel. The current approach forces the compaction 
of the entire heap (moving objects) to be done by one 
thread and all others must wait for completion. 

BRIEF DESCRIPTION OF DRAWINGS 

In order to promote a fuller understanding of this 
and other aspects of the present invention, an embodiment 
will now be described, by way of example only, with 
reference to the accompanying drawings in which: 
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Figure 1 is a schematic representation of a platform 
supporting the embodiment of the present invention; 

Figure 2 is a schematic representation of a Java 
Virtual Machine embodying the invention; 

Figure 3 shows the structure of objects in a thread 

heap ; 

Figure 4 is a flow diagram depicting the memory 
management process of the invention; 

Figure 5 is a flow diagram of the process of the 
write barrier; and 

Figure 6 is a schematic representation of objects in 
a thread heap and the global heap of the JVM. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENT 

Referring to Figure 1 there is shown a computer 
platform 10 such as pentium processor based PC typically 
comprising motherboard with processor, 32Mbytes RAM 
memory, and interface circuitry; 4G Byte hard drive; 20x 
CD ROM; SVGA moridtor; keyboard and mouse. On power on the 
platform loads iAto the memory an operating system 12 
such as Windows NT v4 and subsequently a Java Virtual 
Machine (JVM) bas^d on the Java Development Kit (JDK) 
vl.l or vl.2. The jWa virtual machine is capable of 
loading a Java application 16 from disk into memory so 
that the application fnay be executed on the platform. The 
Java application 16 is\object orientated program code and 
creates Java language oVjects 18A, 18B, 18C, 18D. An 
example of how the objects are built by the JVM is 
described with reference \o Figure 3. 
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The JVM 14 comprises code components for the 
multitude of tasks it has to perform (see Figure 2) . The 
components of relevance to the present embodiment 
comprise memory manager 20, Java interpreter 2 2 and 
just-in-time compiler 24. In operation the JVM creates 
memory space, the runtime data area 26, for storing the 
objects 18 and data for the application 16 . 

The runtime data area 2 6 comprises thread memory 
space 28A, 28B,\28n, global heap 34, class area 36 and 
compiled method \rea 38. The thread memory space 28 
ideally stores 1 octal thread objects, local thread 
execution data and\also objects marked global. Local 
objects are objects V/hich are used by one thread alone 
and are not common between. Thread objects are stored in 
thread heaps 32A, 32B,\32n and thread execution data 
including object references and register data are stored 
in the thread stacks 30a\ 30B, 30n. The global heap 34 
stores objects which are common to more than one thread, 
that is global objects. AVLocal object which is loaded 
into a thread heap 32 A for processing by thread 2 8A may 
become global if it is lateAused by another thread, say 
thread 2 8B, and may remain in\the thread heap 2 8A or be 
moved to the global heap 34 by^ 'stop everything'. Class 
area 3 6 stores the classes as they are loaded into the 
JVM, classes are defined as global because they are 
common to all the threads hence any field of a class is 
globally reachable. An object, which is an instance of a 
class, is placed in a thread heap and is initially local 
because it may only be used by that thread. An exception 
is instance objects of class Thread or Runnable which are 
global at creation because they can be reached by other 
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threads. Whereas an object referenced by a variable in a 
class is global because the object is reachable from the 
class which is reachable by all threads. The compiled 
method area 38 is used by the JIT compiler 24 to store 
native executable code compiled from the Java byte code. 

The memory manager 2 0 includes an object allocator 
21A and a garbage collector 21B which between them 
control the creation and deletion of objects within the 
thread heaps 32A, 32B, 32n and the global heap 34. 

The Java interpreter 22 includes functionality to 
process the multitude of Java byte codes which make up 
the Java byte code function set. A write operation 
component 23A processes java write operations such as 
'putfield', 'putstatic', 'aastore' using the operating 
system and microprocessor of the platform. Putfield and 
aastore are write operations which among other things 
assign a source object reference to a field in a target 
object. This means that the target object will contain 
a reference (or pointer) to the source object; the target 
object is the object that will be updated by the putfield 
operation. Putstatic assigns an object reference to a 
static variable in a class. A write barrier component 23B 
checks the status of the target object and assigns a 
local or global status to the source object depending on 
certain rules: 

1) if an object reference (of a source object) is 
assigned to a static variable of a class the source 
object becomes global and all objects reachable from it 
also become global; and 



UK998093 



11 



2) if a non-global source object is assigned to a 
reference in a target object which is global then the 
reference^ source object and all objects that are 
reachable from it become, global. 

The Java write operation 'putfield' may set a field 
within a target object to point at a source object using 
the operating system commands or platform operations . A 
'putfield' write operation 23A together with the write 
barrier 23B is described below in pseudo platform code 
with comments . 

If field to be updated in target object is of 
'reference' type. 

if (target is global) {Check to see to the global 

flag is set for the target 

if (source is not global) {If so check to see 

if the source has 
already been set global 

set source to global {If not already set then 

set source to global 

set objects reachable by source to global 

{and set all objects 
reachable, (i.e., directly 
referenced by source object 
or referenced from another 
object reachable from the 
source object by the source 
object to global 
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set slot ( target , * field number, source) 

{sets a field variable 
within a target object to 
point at a source object. 

The last instruction code is the write operation 23A 
whereas the instructions before that constitute the write 
barrier 23B. Similar pseudo code may be provided for the 
putstatic and aastore java commands, and other JVM 
actions, which store references in objects. 

The JIT compiler 24 is similar to the Java 
interpreter 22 in that it contains a write operation 
component 2 5A and a write barrier component 2 5A. However 
instead of interpreting java byte code in the methods of 
objects and processing the operations in real time, 
native code similar to the pseudo code above is created 
and stored in the compiled method area 38 for execution 
directly by the platform and operating system whenever 
that particular method is invoked. 

Objects 18 are stored in the thread heaps 32 using 
multiples of 8 bytes (see Figure 3) . Each object has a 
length word attached which identifies the length of the 
object in bytes and allows objects in the heaps to be 
scanned sequentially from the start of the heap. The 
length word (4 bytes long in the embodiment) has three 
spare bits available because of the 8 byte alignment 
boundary and one of these bits is used as a flag for the 
object to store the local or global status. In the 
embodiment shown in Figure 3 each of the objects is 40 
bytes long which is 5*8 bytes or '00101000' bytes in 
binary; the last three '000' being spare. This is 
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indicated at the high end of the word in each case. The 
low end of the word is spare and thus the first bit of 
the length word is used to indicate the local/global 
status, in this case object 18A and 18B are set to '1' to 
indicate that they are global and 18C is set to '0' to 
indicate that it is local. 

The proce^te of memory management is described with 
a preference to Fi&ure 4. Step 4 . 1 an object is created from 
a class stored ih the class area 36. Step 4.2 check to 
see if the class lis a 'global' class, i.e., a class all 
of whose instances are global or one whose instances we 
expect to quicklyt become global, whereby the object is 
assigned global aAd/or placed in the global heap (Step 
4.7). Step 4.3 thfe size of the object is calculated to 
see whether it wilA fit in the thread heap. Step 4.4 if 
there is not enough\space then garbage collection is 
performed to free ud memory. Step 4.5 place the object in 
the thread heap memoiry along with the object length in 
multiples of eight. Step 4.6 Use a spare bit in the 
length attribute as a Vlag and set as local ('0'). The 
process ends at step 4 \ 8 . 

The write operation and write barrier process are 
>£v 'described with reference to Figure 5. Step 5.1a write 
operation is called and intercepted by the write barrier 
which is integrated wifch the write operation code. Step 
5.2 Check whether the object is being assigned to a 
static variable of a class\and if so set the status of 
the object as global (step Su4) and set all objects 
reachable by the object as glbbal (step 5.5) before 
moving to step 5.6, if not so tshen do step 5.3. Step 5.3 
Check whether the write operation assigns the object to 
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another glcs^bal object, if so set the status of the object 
as global (Sstep 5.4) and set all objects reachable by the 
object as glcibal (step 5.5). Once complete continue with 
the write operation. 

The garbage collector 21A for single thread garbage 
collection is described in terms of pseudo code with 
comments . 

Trace stack in thread {Identify objects in use by 

the thread - these are held 
in the thread stack 



Trace reference objects {Identify objects 

referenced directly and 
indirectly by the stack 
objects . 

Delete unused objects {those objects in the heap 

which are not identified in 
trace or are not identified 
as global 



Check heap size {if not enough memory then 

increase size of the local 
heap. If not desirable or 
possible then do 'full' 
garbage collection 
Full garbage collection is traditional Mark/Sweep 
performed with all other threads stopped. During full 
garbage collection global objects are moved from local to 
the global heap if possible. Object movement is not 
possible if conservative tracing is performed and the 
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object is referenced directly from the stack because of 
the uncertainty of the tracing. 

The Java language pseudo code program below 
(together with comments) creates objects representing a 
bike with two wheels and a bell in a local thread which 
is put into a global shed. The bike and wheels are make 
global whereas the bell remains local. If a further new 
bike object were to be created and the thread heap had no 
more memory then the garbage collector would mark and 
sweep the bell object so long as it was not pointed to by 
the stack. 

// Demonstrate the Thread local heap in action. 

// For simplicity all niceties such as constructors 
have been omitted and the example 

// made deliberately unreal to illustrate the basic 
point. Objects generally start life 

// local and become global by assignment. 

// This is a fake class just to illustrate a static 
variable in operation. 

class Shed { // This is a fake class just to illustrate 
a static variable in operation. 

static Bike; 

// The main method will run the sample code to 
illustrate creation of local 

// objects which later will become global. 

public static void main (String args [ ] ) { 

Bike b = new Bike(); // build a new Bike object 

// The Bike class is loaded and 
initialised and an instance of Bike is created in the 
local heap. It is marked 'local'. 
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Bell c = new Bell ( ) ; // build a new Bell object. 
The Bell class is loaded and initialised and an instance 
of Bell is created in the local heap. It is marked 
'local' . 

Wheel w = new Wheel () ; // build a new wheel 
object. The Wheel class is loaded and initialised and an 
instance of Wheel is created in the local heap. It is 
marked ' local ' . 

b.wheell = w; // Assign the first wheel of the bike 
Java bytecode putfield, write barrier detects that target 
(b) is local and hence w remains local. 

Shed. bike = b; // Assign b to the static field 

in bike Shed. Java bytecode putstatic is used and thus b 
is marked global. Since w is reachable from b then w is 
also marked global . 

w = new Wheel ( ) ; // build another new wheel 
object. Another instance of Wheel is created in the 
local heap. It is marked 'local'. 

b.wheel2 = w; // Assign the second wheel of the 

bike 

// Java bytecode putfield, write 
barrier detects that target (b) is global and hence w is 
marked global also. 

// Note that at this point we have built a bike, two 
wheels and a bell. The bell is the only object which is 
still local since the bike has been assigned to the 
static variable 'bike' . 
} 

} 

class Bike { 
Wheel wheel 1; 
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Wheel wheel2 ; 
Bell belli; 
String name; 
} 

class Wheel { 
int diameter ; 
} 

class Bell { 
int volume; 
} 

In summary there is described a method and system 
for memory management in a virtual machine or operating 
system and in particular an object creation and garbage 
collection method and system. There is described a method 
and system of managing memory in a multi-threaded 
processing environment such as a java virtual machine 
comprising: creating an object in a thread heap; 
associating a status with the object and setting the 
status as local; using write barriers to change the 
status to global if the object is assigned to a static 
variable or if the object is assigned to a field in a 
global object; changing the status of all objects 
referenced by that object to global; and performing 
garbage collection by deleting from the thread heap, when 
memory space in the thread heap is required, one or more 
local objects which can not be traced to the thread 
stack. 

The platform in the embodiment does not need to be 
Pentium based nor is restricted to the hardware stated. 
Any platform which is capable of supporting JDK vl . 1 or 
vl.2 would be capable of supporting the virtual machine 
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of embodiment. Although the embodiment is described with 
reference to a Java Virtual Machine it is not necessarily 
so restricted to a virtual machine but may be used in any 
environment where storage is garbage collected to free up 
memory in that environment. For example, such an 
environment could be an multithreaded Lisp runtime or the 
runtime for some other multithreaded garbage collected 
language . 

Although we have described in the preferred 1 
embodiment the local/global status as a field in the 
object it may be implemented in other ways for instance 
as a separate table. 

Java and Java-based marks are trademarks or 
registered trademarks of Sun Microsystems, Inc. 

Microsoft, Windows, and Windows NT are trademarks of 
Microsoft Corporation . 

Now that the invention has been described by way of 
a preferred embodiment, various modifications and 
improvements will occur to those person skilled in the 
art. Therefore it should be understood that the preferred 
embodiment has been provided as an example and not as a 
limitation . 



